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Abstract — The leakage power can dominate the system power 
dissipation and determine the battery life in battery-operated 
applications with low duty cycles, such as the wireless sensors, 
cellular phones, PDAs or pacemakers. Driven by the need of 
ultra-low power applications, this paper presents single ended 
6T SRAM (static random access memory) cell which is also 
radiation hardened due to maximum use of PMOS 
transistors. Due to process imperfection, starting from the 65 
nm technology node, device scaling no longer delivers the 
power gains. Since then the supply voltage has remained 
almost constant and improvement in dynamic power has 
stagnated, while the leakage currents have continued to 
increase. Therefore, power reduction is the major area of 
concern in today's circuit with minimum-geometry devices 
such as nanoscale memories. The proposed design in this 
paper saves dynamic write power more than 50%. It also 
offers 29.7% improvement in T WA (write access time), 38.5% 
improvement in W PWR (write power), 69.6% improvement in 
Wedp (write energy delay product), 26.3% improvement in 
Wedp variability, 5.6% improvement in /? PW r (read power) at 
the cost of 22.5% penalty in SNM (static noise margin) at 
nominal voltage of V BB = 1 V. The tighter spread in write EDP 
implies its robustness against process and temperature 
variations. Monte Carlo simulation measurements validate 
the design at 32 nm technology node. 

Index Terms — Random dopant fluctuation (RDF), line edge 
roughness (LER), read access time, static random access 
memory (SRAM), drain induced barrier lowering (DIBL), 
short channel effect (SCE) 

I. INTRODUCTION 

Due to aggressive scaling, fluctuations in device 
parameters are more pronounced in minimum-geometry 
devices commonly used in area-constraint circuits such as 
SRAM (static random access memory) cells [1]. SRAM is 
a highly used circuit of modern chips. SRAM constitutes 
more than half of chip area and more than half of the 
number of devices in modern designs [2]. As per ITRS 
prediction, embedded cache will occupy 90% of an SoC 
(system-on-chip) by 2013 [3]. Even today, an H-264 
encoder for a high-definition television requires, at least, a 
500 kb memory as a search- window buffer that contributes 
40% to its total power dissipation [4]. The SNM (static 
noise margin) model in [5] assumes identical device 
threshold voltages across all cell transistors, making it 
unsuitable for predicting the effects of threshold voltage 
mismatch between adjacent transistors within a cell. 
Therefore, designer will require reinvestigation and 
analysis of SNM in scaled technologies to ensure stability 
of SRAM cell. 



Low power operation is a feature that has become a 
necessity in today's SoCs and uPs (microprocessors); 
hence, power consumption of SRAM modules must be 
reduced and has been under extensive investigation in the 
technical literature [6]. However, with the aggressive 
scaling in technology, substantial problems are encountered 
when the conventional 6T (six transistors) SRAM cell 
architecture is utilized. Therefore, extensive research is 
carried on designing SRAM cells for low power operation 
in the deep sub-micron/nanometer regimes [7-9]. The 
common approach to meet the objective of low power 
design is to add more transistors to the original 6T cell. An 
8T cell can be found in [7]; this cell employs two more 
transistors to access the read bitline. Two additional 
transistors (thus yielding a 10T cell design) are employed 
in [8], [9] to reduce the leakage current. However, these 
approaches increase the cell area and thus not suitable for 
area constraint cache design. 

The objective of this paper is to propose a new SRAM 
cell that is scalable to small feature sizes such as 32 nm. 
The novelty of this cell is that its design requires only 6 
transistors (6T). The architecture of the cell is different 
from that of conventional 6T SRAM cell which is 
differential in nature whereas the proposed cell is single 
ended. Moreover, the design utilizes mostly PMOS 
transistors to make it radiation-hardened. The simulation 
results of the proposed single ended PFET-based 6T 
SRAM cell (hereafter called SEP-6T) are compared with 
differential 6T SRAM cell (hereafter called Dif-6T) and it 
is demonstrated that the proposed scheme offers significant 
advantages in terms of write dynamic power, write access 
time, write power, write EDP and read power at the 
expense of static noise margin. Moreover, simulation 
results using HSPICE confirm that the proposed SEP-6T is 
suitable for implementation at 32 nm CMOS technology. 

The remainder of the paper is organized as follows. 
Various failure mechanisms are briefly reviewed in Section 
II. Section III briefly discusses impact of scaling - DIBL 
(Drain-Induced Barrier Lowering) and SCE (Short-Channel 
Effect), read stability, and write-ability. Section IV presents 
brief discussion on read/write operation, cell sizing and 
dynamic power saving of proposed SEP-6T. Section V 
explains the simulation measurement results and compares 
proposed design with Dif-6T. Finally, Section VI 
concludes the paper. 
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II. FAIURE MECHANISMS IN SRAM 

Failures in SRAM cell may be of three types - hard 
failures, soft failures and parametric failures. 1) Hard 
failures are caused by open or short. 2) Soft failures - there 
are major three sources of soft failures. They are alpha 
particles released from the radioactive impurities such as 
packaging materials, high energy neutrons from terrestrial 
cosmic radiations and the interaction of cosmic ray thermal 
neutron. 3) Parametric failures - since these failures are 
caused by the variations in the device parameters, these are 
known as the parametric failures. Parametric failures 
include access failure defined as unacceptable increase in 
SRAM cell access time; read failure is defined as flipping 
of cell content while reading; write failure is defined as 
inability to write to a cell and hold failure is defined as 
flipping of the cell state in the standby mode, especially 
when K DD approaches or falls below DRV (data retention 
voltage) [10-13]. 

III. IMPACT OF SCALING AND READ/WRITE STABILITY 

Due to scaling, substantial problems arises while 
designing conventional 6T SRAM cell. Therefore, impact 
of scaling on read stability and write-ability is required to 
be reinvestigated at scaled technology such as 32 nm 
technology node. 

A. DIBL and SCE on Scaled Devices 

It is well known, that the threshold voltage is given by 

V t =V t0 +y(^{2\if F \+V SB )-p\isf F \) (1) 

where V t0 is the threshold voltage at K SB = V and is 
mostly a function of the manufacturing process, difference 
in work-function between gate and substrate material, 
oxide thickness, Fermi voltage, charge of impurities 
trapped at the surface, dosage of implanted ions, etc; V$ B is 
the source-bulk voltage; (|) F = V T \n(N A /ni) is the bulk Fermi 
potential (where V T = kT/q = 26 mV at 300 K is the thermal 
voltage, N A is the doping concentration of acceptor ion of 
substrate, n x is the intrinsic carrier concentration in pure 
silicon); y = V(2q7VA8 S i)/C 0X is the body-effect coefficient 
(where s si is the permittivity of silicon, C ox = 8 0X /t 0X is the 
gate oxide capacitance). 

Equation (1) is suitable for estimating the threshold 
voltage for long channel devices (> 1 urn). It fails to model 
threshold voltage of nanoscaled devices. To model 
threshold voltage for nanoscaled devices many factors such 
as SCE (short-channel effect), RSCE (reverse short- 
channel effect), NWE (narrow-width effect), DIBL (Drain 
Induced Barrier Lowering) effect, etc., are to be taken into 
account. In short-channel devices, the depletion region 
around the drain increases as K DS increases and it extends 
into the channel region. Conceptually, the drain voltage 
assists the gate voltage in the depletion process. Hence, 
V t decreases due to DIBL 



TABLE I. 
DIBL Effect on Threshold Voltage 
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Figure 1. Threshold voltage (V t ) versus drain to source voltage (T D s) of 
NMOS transistor at 32 nm technology node. V t variation due DIBL 



effect. The DIBL effect can be modeled as 



W WO ^DIBL'DS 



(2) 



where V t0 is the threshold voltage at V m = 0V, and n DIB L 
is the DIBL coefficient approximately equal to 0.191 for 
NFET. This is computed and verified by simulation using 
BPTM 32 nm technology. The results are tabulated and 
plotted in Table I and Fig. 1 respectively, which shows 
dependency of V t on K DS . The Fig. 1 particularly, shows 
that the V t is 0.6292 V at V DS = V and the V t drops to 
0.4381VatK DS =l V. 

B. Read Stability 

During correct read operation, the stored data in Q (say, 
storing "1") and QB (storing "0") are transferred to the 
bitline (BL) and bitline bar (BLB) by leaving BL at its 
precharged value and discharging BLB through MN3 and 
MN1 (Fig. 2). A careful sizing of MN1 and MN3 is needed 
to avoid accidental read upset due to building up of Vq B at 
QB. Minimum-sized transistors are necessary to design 
small-sized bitcell, which will result in longer read access 
time (Tra) because of slow discharging of large bitline 
capacitance through small- sized transistor (offering higher 
resistance). As the difference between BL and BLB builds 
up, the sense amplifier (not shown) [14] is activated by 
asserting sense enable (SE) high to accelerate the reading 
process. At the point of 10% discharge of BLB, sense 
amplifier is enabled to sense the differential voltage 
between BL and BLB. Full-swing discharge of BLB does 
not take place. Hence, condition K ds ,mn3 > ^gs,mn3 ~ Fyvcsra 
remains valid, which implies that NM3 remains in 
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saturation region and MN1 remains in linear region. 
Boundary condition on MN1 and MN3 sizing to avoid read 
upset is achieved by equating BLB discharging currents 
through MN3 and MN1 as given below 



p 



MN3 



7 DD ~ y QB 



V n „ ~ V 



t,MN3 ) 



where 



- PMNl i VDD Vt,MN\ rQB r> 



P = ^ n C ox (W/L). 



(3) 



(4) 



Equation (3) can be solved for K QB by substituting F t;M N3 
and K t MNi using (2) and Table I. The Cell Ratio (CR) (also 
called p ratio) is defined as 



CR = 



(W/L) A 



(W/L) 



(5) 



MN3 



CR is required to be set appropriately to avoid 
accidental read upset. The Fig. 3 shows the plot of K QB 
versus CR of Dif-6T simulated using MN1/MN3 size ratio 
from 0.1 to 3.5 with minimum-sized transistors (i.e. MP1 = 
MP2 = MN2 = MN4 = 32 nm/32 nm). It is observed from 
Fig. 3, that the voltage rise at QB with CR = 0.1 is 0.3285 
V, whereas V tn ranges from 0.4381 V to 0.6292 V for V DS 
ranging from 1 V to V (Fig. 1) in 32 nm technology node. 
It implies that read upset may not occur due to voltage rise 
while reading, if CR is lowered even up to 0.1. Read 
stability also largely depends on DIBL, which is inherent to 
technology scaling (i.e. SCE). To understand the 
dependence of read upset on DIBL, consider the case when 
Vq B is increasing from 0. The increase in Vq B will increase 
the K ds ,mni, (Drain to source voltage of MN1). It is obvious 
(Fig. 1), that this will reduce the V t of MN1 thereby making 
it stronger. This unexpected mismatch in threshold voltage 
will affect the SNM of the SRAM cell (decrease in V t will 
decrease SNM) [5]. An SRAM cell should be designed 
such that under all conditions some SNM is reserved to 
cope with dynamic disturbances. Therefore, considering 
the trade- off between SNM and cell size, CR is set to 1 for 
the analysis in this paper (i.e. Dif-6T and SEP-6T cells are 
made with minimum-sized devices). It is therefore obvious 
that, neither the proposed design nor its counter part will 
suffer from read upset problem. 
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Figure 2. Standard 6T SRAM cell 
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Figure 3. Voltage ripple (Pq B ) versus cell ratio while reading Dif-6T 



> 



o.s 



0.6- 



m 

£ 

a 
> 



0.4- 



0.2 - 



(3, 0.4S0B)- 
(2.5, 0.3933) 




12 3 

Pull-up Ratio 

Figure 4. Voltage ripple versus pull-up ratio while writing to Dif-6T 

C. Write-Ability 

A reliable write operation is possible if node storing "1" 
(say, Q in Fig. 2) is pulled low enough - below the 
threshold voltage of MN1, so that writing a "1" is possible 
to the other storage node (say, QB) by flipping the cell 
state. Sizing of the transistors MP2 and MN4 should be 
such that this occurs causing flipping of state, failing which 
will cause write failure. The size ratio between MP2 and 
MN4 called pull-up ratio (PR) (also called y ratio) is 
defined as 



PR = 



iWIL) MP2 

(WIL) MN4 



(6) 



Fig. 4 plots the simulated results of Vq versus PR 
with minimum-sized transistors (i.e. MPl = MN1 = MN2 = 
MN3 = 32 nm/32 nm). As observed from Fig. 4, to avoid 
write failure, PR must be chosen such that Vq falls below 
threshold voltage of MN1. As mentioned above, threshold 
voltage V tn = 0.4381 V at V DS = 1 V (Fig. 1) at 32 nm 
technology node. As observed from Fig. 4, Vq = 0.4808 V 
at PR = 3, which implies that PR should be set below 3 to 
avoid write failure. The PR for the proposed design as well 
as Dif-6T in this paper is set to unity which ensures their 
write- ability. 

IV. Single Ended PFET-based 6T SRAM Cell 

Fig. 5 shows the proposed design. It consists of two 
cross-coupled inverters connected in a positive feed-back 
loop through transistor MP5. Unlike, conventional 
differential 6T SRAM cell, only one PMOS transistor MP3 
is used as access transistor for accessing the cell. 
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A. Read/Write operation and Cell sizing 

Writing as well as reading takes place only through 
BLB. Thus, single ended writing and sensing (reading) 
scheme is required for the proposed design. All the 
transistors (four PMOS and two NMOS) are of minimum- 
sized (each 32 nm x 32 nm) to achieve smallest cell area. 
PMOS transistors are used as major devices to get the 
benefit of its higher radiation tolerance. Leakage currents 
in NMOS transistors increase after radiation bombardment, 
but leakage currents in PMOS devices remain almost 
unaffected [15]. Excessive leakage in access transistor may 
corrupt the stored data in storage node of the cell. 
Therefore, usage of PMOS transistors will improve the 
hold failure probability of the SRAM cell. Moreover, 
PMOS devices have an order of magnitude smaller gate 
leakage than NMOS transistors [16]. 

Before and after each write operation BLB is 
precharged low, by asserting PCL (pre-charge low) high at 
the gate of an NMOS whose drain and source are 
connected to BLB and GND (not shown). Write operation 
of SEP-6T is initiated by switching MP5 off applying W = 
"1" (in other time, i.e., read and hold time MP5 remains 
on). This breaks the feedback path and makes writing 
easier. In the literature single ended 5T SRAM cell is found 
[17], which exhibits problem while writing "1" and needs 
write assist method. BLB carries the complement of the bit 
to be stored in storage node Q (or equivalently BLB carries 
the bit to be stored at storage node QB). On application of 
WL = "0", bit applied on BLB gets complemented and 
stored at Q. This bit in turn drives INV1 (inverter 1) and 
finally a bit applied at BLB gets stored at QB. Thus, write 
operation involves two inverters' delay. As MP3 passes a 
bad "0", at the end of write "0" operation at QB, K Q2 is at 
higher potential than GND (QB). This is evident from Fig. 
6, which shows various node voltages while writing "0" @ 
QB. Just after write operation, when W is pulled down in 
hold mode, MN2 may partially conduct causing short- 
circuit current to pass from K DD to GND through MP2. 
This problem can be mitigated in two ways; 1) by using 
low- F t (absolute value) MP5 to reduce voltage difference 
between Q2 and QB and 2) using high-K t MN2 to reduce 
the possibility of it being on due to residual K Q2 . Zero bias 
threshold voltage (K tn0 ) of NMOS is 0.63 V at BPTM 32 
nm technology. It is already quite high compared to zero 
bias threshold voltage (Fjp ) of PMOS. Therefore, in this 
paper low-F t (absolute value) MP5 is used (-0.3 V in spite 
of -0.5808 V). Extensive simulations are performed to 
obtain optimum results by selecting different V t s. Optimum 
results in terms of various design metrics are obtained with 
the V t s as shown in Table II. During write "0" to QB 
operation, BLB is precharged initially to GND but not 
charged. During write "1" operation at QB, BLB is initially 
precharged to GND and then charged by the write circuit 
(not shown). This reduces the switching activity factor, 
which helps in saving dynamic power dissipation. The 
write "1" to QB operation is illustrated in Fig. 7, which 
shows various node voltages during write operation. For 
the single ended read operation an inverter can be used to 
sense the BLB, but other single ended sensing mechanism 
such as charge re-distribution amplifier could improve read 
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speed [14]. In this analysis, read operation is simulated 
with QB holding "1". This is illustrated in Fig. 8, which 
shows various node voltages while reading with QB 
holding "1". 
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Figure 5. Proposed single ended 6T SRAM cell (SEP-6T) 
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Figure 6. Voltage transfer curve (VTC) of various nodes while writing 
"0" @ QB 

TABLE II. 
Threshold voltage for Optimum results 
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Figure 7. VTC of various nodes while writing "1" @ QB 
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Figure 8. VTC of various nodes while reading with QB storing "1" 

B. Dynamic Power Saving 

The dynamic power is one of the major contributors to 
the total power consumption in the SRAM cell. The 
dynamic power consists mainly of two components - write 
power and read power. These power dissipations occur due 
to charging/discharging of large bitline capacitances. The 
bitline capacitance consists of source/drain diffusion 
capacitance of access transistor, interconnect capacitance 
and contact capacitance. The expression for bitline 
capacitance is given by 



Cbl - Cblb - Csi 



S I D,cap 



c +r 



(7) 



The dynamic power dissipation during write operation 
contributes 70% to the total dynamic power dissipation in 
SRAM cell [18]. Write dynamic power dissipation while 
writing to a cell is given by 



C BL X ^BL X ^DD X ^WRITE 



(8) 



variations affect the performance and reliability of a 
design. 

A. Write Access Time 

Write access time (T WA ) is estimated at different 
voltages while writing "1" @ QB. Table III presents the 
measured r WA of both the designs. Fig. 9 plots the 
simulated results for making the comparison easier. The 
r WA is estimated as the time required for writing "1" at QB 
from the point when WL reaches 50% of its full swing 
from its initial high level to the point when QB rises to 
90%) of its full swing from its initial low level. As 
mentioned in Section IV- A, for writing a bit @ QB, the 
write operation is completed after a two inverter delays 
when QB settles to its new state. Therefore, the r WA is 
longer than 7^ (read access time) as shown in Section V- 
D. To understand why the T WA of SEP-6T is lower than 
that of Dif-6T, remember that, V t of MP2 (K tp0 = - 0.5808 
V) of Dif-6T is lower (absolute value) than that of MP2 
(^tpo = - 0.63 V) of SEP-6T, Thus, MP2 of Dif-6T is 
stronger than that of SEP-6T. This prevents MN4 of Dif-6T 
to drop Vq quickly. Another reason of lower write delay in 
SEP-6T is that the MP3 (K tp0 = - 0. 3 V) of SEP-6T passes 
a strong "1" from BLB to drive INV2 (inverter 2) harder 
and MP2 of SEP-6T is also made weaker by increasing its 
V t (absolute value). This helps MN2 to quickly pull down 
storage node Q, which in turn drives INV1 and quickly 
flips QB. As observed from the Table III, proposed design 
offers 29.7% improvement in r WA at nominal voltage of 
K DD =1V. 

TABLE III. 
Write Access Time 



where a B L is the switching activity factor, F WRITE is writing 
frequency to SRAM cell. Writing to a conventional Dif-6T 
cell (Fig. 2) requires discharging (from initial high value) 
of one of the bitlines per write operation whereas the 
proposed design requires charging (from initial precharged 
GND value) of its only bitline (BLB) while storing "1" at 
storage node QB (or "0" @ storage node Q). This reduces 
ocbl to < 0.5. It is therefore, obvious from (8), that Pwrite is 
improved by more than 50% if proposed design is used 
replacing Dif-6T. 

V. Simulation Measurement Results and Discussion 

This Section presents measurements of various design 
metrics which are measured during simulation on HSPICE 
using BPTM 32 nm technology [19]. Monte Carlo 
simulations are performed for the measurements. During 
Monte Carlo simulation, process parameters such as L 
(channel length), W (width), t ox , (oxide thickness), u (zero 
bias carrier mobility) and R a (sheet resistance) are varied 
by ±10%) with Gaussian distribution function. The 
temperature is varied from 24°C to 134°C as well using 
absolute Gaussian function. Simulation measurements are 
taken for both the designs applying ±3 a variation. Monte 
Carlo simulation is a method for iteratively evaluating a 
design. The goal is to determine how random PVT 
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Figure 9. Write access time while writing "1" QB 
Write Power Measurement 



Write dynamic power defined in (8) is caused due to 
charging/discharging of bitline. Dynamic power dissipation 
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during accessing a cell through its devices is generally 
ignored, though it contributes a large portion to the total 
dynamic power consumption. During write access in SEP- 
6T, while writing "1" at QB, K DD finds path to GND 
through two pull-down transistors MN1 and MN2 (Fig. 5). 
BLB is asserted high for writing "1" @ QB. BLB is 
connected to Q2 through MP3 while writing. The supply at 
BLB does not find path to GND. Therefore, major dynamic 
power dissipation during transition (accessing) is through 
MN1 and MN2. Thus, total average write transition power 
of SEP-6T is given by 



Pw,SEP-6T ~PaVG,MN1 + ^AVG,MN2 



(9) 



where Pavg,mni and .Pavg,mn2 are average write transition 
power through MN1 and MN2 while writing "1" @ QB. In 
case of Dif-6T, BLB is biased at V D d and BL is grounded 
by write circuit (not shown). BL pulls-down storage node 
Q below switching threshold of INV1 and finally flips the 
cell to write a "1" @ QB. During this transition, V DD finds 
path to GND through three transistors MN1, MN2 and 
MN4. Therefore, total average write transition power of 
Dif-6T is given by: 



Pw,Dif-6T - PaVG,MN\ 



1 AVG,MN2 



1 AVG,MN4 



(10) 



where P A vg,mni, ^avg,mn2 and P A vg,mn4 are average write 
transition power through MN1, MN2 and MN4 while 
writing "1" @ QB. Total average write transition power, 
i.e., ZVsep-6t and iVoif-6T, commonly abbreviated as JFpwr 
is estimated at different voltages while writing "1" @ QB 
and the results are plotted in Fig. 10, which shows that the 
SEP-6T outperforms Dif-6T at all supply voltages. 
Equations (9), (10) and the above discussion clarify the 
reason of its superiority . As o bserved from the Fig. 10, 
proposed design offers 38.5% improvement in J^Wr at 
nominal voltage of V DD = 1 V. 

C. Write EDP Measurements 

Reduction of EDP is a good direction for optimizing 
any digital circuit including SRAM cell. Since EDP reflects 
the battery consumption (E) for completing a job in a 
certain time (D), it is an important design metrics. 
Therefore, EDP of SEP-6T and Dif-6T are estimated and 
results are compared. The write energy delay product 
(^edp) is estimated using write delay and write power 
while writing "1" @ QB. The Wed? of SEP-6T is defined as 
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The Wed? of Dif-6T is defined as 



W, 



EDP,Dif-6T 



= P* 



xT" 



W,Dif-6T A 1 WA,Dif-6T 



(ii) 



(12) 
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Figure 10. Total average write transition power versus V DD 
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Figure 1 1 . Write EDP ( ^edp) estimated at different voltages while writing 
"1" @ QB versus V DD 
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Figure 12. Write EDP (JFedp) variability estimated at different voltages 
while writing "1" @ QB versus V DD 

The measurements of W E m are taken at various 
operating voltages starting from nominal voltage of V DD = 1 
V down to the minimal operating voltage of 0.8 V beyond 
which write failure begins to occur under the used writing 
condition (if write condition is changed write failure may 
not occur). The results are plotted in Fig. 11, which shows 
that the SEP-6T outperforms Dif-6T at all supply voltages. 
As observed from the Fig. 11, proposed design particularly 
offers 69.6% improvement in W^ m at nominal voltage of 
K DD =1V. 

Write EDP (J^edp) variability (o/u) is estimated at 
various supply voltages and the results are plotted in Fig. 
12. As observed from Fig. 12, fluctuations in W E m of Dif- 
6T at lower voltages are quite satisfactory, but it shoots up 
at 0.95 V, whereas variations in SEP-6T are moderate at all 
supply voltages. To make both the circuits variation 
immune in terms of Wed?, they are to be operated at 0.9 V. 
As observed from the Fig. 12, proposed design offers 
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26.3% improvement in W^qv variability at nominal voltage 
ofF DD =lV. 

D. Read Access Time 

As mentioned in Section IV- A, for single ended read 
operation an inverter can be used to sense the BLB, but 
other single ended sensing mechanism could decrease the 
read delay. Read access time (Tra) is estimated at different 
voltages with QB storing "1" for both the design at iso- 
measuring condition (i.e., with equal WL slew rate, on time 
and same operating voltage). The measured results are 
reported in Table IV. The results are also plotted in Fig. 13 
for making comparison easier. 

The Tra (for Dif-6T) is estimated as the time required 
for reading with QB storing "1" from the point when WL 
reaches 50% of its full swing from its initial low level to 
the point when BL falls by 10% of its full swing from its 
initial high level. The Tra (for SEP-6T) is estimated as the 
time required for reading with QB storing "1" from the 
point when WL reaches 50% of its full swing from its 
initial high level to the point when BLB rises to 90% (so 
that inverter connected to BLB to sense its voltage is able 
to do so without fail) of its full swing from its initial low 
level. As observed from Fig. 13, SEP-6T outperforms Dif- 
6T at all supply voltages except at nominal voltage of V DD 
= 1 V (where T^ of SEP-6T equals T^ of Dif-6T). 



TABLE IV. 
Read Access Time 



SRAM 


Tra with QB storing "1" 

(s) 


(V) 


Dif-6T 


1.667e-ll 


1 


1.913e-ll 


0.95 


2.255e-ll 


0.90 


2.764e-ll 


0.85 


3.574e-ll 


0.80 


5.040e-ll 


0.75 


SEP-6T 


1.644e-ll 


1 


1.790e-ll 


0.95 


1.984e-ll 


0.90 


2.242e-ll 


0.85 


2.588e-ll 


0.80 


3.028e-ll 


0.75 



x 10 



--A-- Dif-GT 
— e— SEP-6T 






0.75 0.S 0.S5 0.9 

V DD (Volts) 



0.95 



of Dif-6T whereas in case of SEP-6T, C BLB is charged 
through low- K t MP3 (K tp0 = - 0. 3 V). It implies that MN4 
does not get on till 0.63 V is applied at its gate through 
WL, whereas only |0.3 V| applied through WL is sufficient 
to put MP3 on. Since slew rate of WL is same for both the 
cases C B lb begins to charge up earlier than C BL begins to 
discharge down. 

E. Read Power Measurement 

Read power is measured with QB holding "1". During 
read access in SEP-6T, V DD finds path to GND through two 
pull-down transistors MN1 and MN2 and storage node 
holding "1" charges C BLB through MP5 and MP3 (Fig. 5). 
Therefore, total average read transition power of SEP-6T is 
given by 



AVG,MN\ + Pr 



+ P* 



(13) 



where P/?_avg,mni and P RA vg,mn2 are average read 
transition power through MN1 and MN2 and P BLB is power 
dissipated for charging C BLB while reading with QB storing 
"1". The P BLB per read operation is given by 



Pblb - Cblb x Vdd x ft 



(14) 



where f t is transition or read frequency. In case of Dif-6T, 
BL and BLB are biased at V D d before each read operation. 
During read transition K DD finds path to GND through 
MN1 and MN2. C BL discharges through MN2 via storage 
node Q. Therefore, total average read transition power of 
Dif-6T is given by 



r R,Dif-6T 



AVG,MN\ + Pr_ 



AVG,MN2 



(15) 



where P R _ AV g,mni and P RA vG,Mm are average read 
transition power through MN1 and MN2 while reading 
with QB storing "1". 

Total average read transition power, i.e., Pr_sep-6t and 
^R_Dif-6T> commonly abbreviated as R?wr is estimated at 
different voltages while reading with QB storing "1" and 
the results are plotted in Fig. 14. As observed from Fig. 14, 
the proposed design outperforms the conventional Dif-6T 
at all higher supply voltages staring from 0.85 V 




0.75 



0.S5 0.9 0.95 
V DD (Volts) 



Figure 13. Read access time while writing "1" QB 

To understand why the Tra of SEP-6T is lower than that of 
Dif-6T, remember that, large C BL discharges (while reading 
with QB holding "1") through high-K t MN4 (K tn0 = 0.63 V) 
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Figure 14. Total average read transition power ( Rpwr) estimated at 
different voltages while reading with QB storing "1" versus V DD 

to nominal voltage of 1 V. At lower voltages such as 0.75 
V and 0.8 V, Dif-6T outperforms SEP-6T. As o bserved 
from the Fig. 14, proposed design offers 5.6% 
improvement in R?wr at nominal voltage of V DD = 1 V. 
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F. Read EDP Measurements 

The Read Energy Delay Product (7?edp) of SEP-6T and 
Dif-6T are estimated and results are compared. 7? EDP is 
estimated using read delay and read power while reading 
with QB holding "1". The R ED ? of SEP-6T is defined as 



ReDP,SEP-6T ~ Pr,SEP-6T X ^RA,SEP-6T 



The i?EDP of Dif-6T is defined as 



R 



EDP,Dif-6T - ^R,Dif-6T X ^RA,Dif-6T 



(16) 



(17) 



The measurements of i? EDP are taken at various supply 
voltages starting from nominal voltage of F D d = 1 V down 
to the minimal supply voltage of K DD = 0.75 V beyond 
which read failure begins to occur under the used reading 
condition (if read condition is changed read failure may not 
occur). The results are plotted in Fig. 15. As observed from 
Fig. 15, SEP-6T outperforms Dif-6T at all supply voltages. 

G. Static Noise Margin Measurements 

The SNM of SRAM cell is defined as the minimum DC 
noise voltage necessary to flip the state of the cell. SNM of 
an SRAM is a widely-used design metric that measures the 
cell stability. Fig. 16 and 17 show conceptual test setup for 
measuring SNM for SEP-6T and Dif-6T respectively. The 
measured results when plotted is called "butterfly curve". 
Fig. 1 8 plots the combined "butterfly curve" of Dif-6T and 
proposed SEP-6T at iso-measuring condition. The butterfly 
curve for SEP-T is obtained in the following way using the 
test circuit: 1) W and BLB are biased at GND voltage and 
WL is biased at supply voltage. 2) Voltage of Nl is swept 
from V to supply voltage while measuring voltage of QB. 
3) Voltage of N2 is swept from V to supply voltage while 
measuring voltage of Q in the same way. 4) Measured 
voltages are plotted to get butterfly curves. The side length 
of Maximum Square that can be embedded within the 
smaller lobe of the butterfly curve represents the SNM of 
the cell. This definition holds good because, when the 
value of noise voltage (V m or K N2 ) increases from 0, the 
VTC (voltage transfer characteristic) for INV1 formed with 
MP1 and MN1 moves to the right and the VTC" 1 (inverse 
VTC) for INV2 formed with MP2 and MN2 moves 
downward. Once they both move by the SNM value, the 
curves meet at only two points and any further noise flips 
the cell [20]. As observed from Fig. 18, SEP-T shows 
22.5% penalty in SNM. 

H. Comparitive Study of Cell A rea 

In multi-port design, area would be the major concern. 
As multi-core uPs, SoCs, and NoCs (network-on-Chip) 
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Figure 15. Read EDP versus V DD 
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Figure 16. Test circuit for measurement of SNM of SEP-6T 
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Figure 17. Test circuit for measurement of SNM of Dif-6T 



■A--Dif-<5T 
-SEP-6T 




Figure 18. Static noise margin of Dif-6T and proposed SEP-6T cell 

will dominate future technological innovation, on-chip 
cache deigned with high density and low bitline switching 
power is an attractive choice. In 65 nm technology node, 
RD-8T (read decoupled 8T) SRAM cell proposed in [21], 
isolates read port from storage node thereby increasing read 
stability (without improving write margin significantly) at 
cost of 30% area penalty compared to dense 6T cell (which 
might me used for higher level caches). Differential data- 
aware power-supplied D 2 AP 8T cell with a boosted bitline 
scheme proposed in [22], incurs an area overhead similar to 
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RD-8T cell. This design also slightly degrades hold static 
noise margin (SNM) due to stacking effect. Authors in [23] 
proposed a 10T SRAM cell which solves column 
interleaving problem without any extra access delay but at 
the expense of large area penally. A 32-bit word with 
single-ended 6T SRAM bitcell in [24] consumes 16% 
(including the read and write assist transistors) larger area 
than standard 6T bitcell and 14% less than 8T proposed in 
[25]. In view of area constraint in future on-chip-cache, this 
paper proposes a novel single-end 6T SRAM cell that 
consumes less area on chip compared to standard Dif-6T 
considering the layout of bitline routing. 

VI. Conclusion 

This paper presents a PFET-based single ended 6T 
SRAM cell which is radiation hardened and capable of 
saving write dynamic power. It has successfully 
demonstrated that the proposed design is robust against the 
impact of process, voltage and temperature variations in 
terms of tighter spread in write EDP. The simulation 
measurement results confirm the successful functionality 
and read/write stability of the proposed design. The 
benefits of read and write power reduction make it an 
attractive choice for applications where static noise is 
negligible. 
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